課程資訊
課程名稱
強化學習
Reinforcement Learning 
開課學期
112-1 
授課對象
電機資訊學院  電機工程學研究所  
授課教師
孫紹華 
課號
CommE5069 
課程識別碼
942 U0830 
班次
 
學分
3.0 
全/半年
半年 
必/選修
選修 
上課時間
星期四2,3,4(9:10~12:10) 
上課地點
電二229 
備註
總人數上限:80人 
 
課程簡介影片
 
核心能力關聯
核心能力與課程規劃關聯圖
課程大綱
為確保您我的權利,請尊重智慧財產權及不得非法影印
課程概述

加簽表單:https://docs.google.com/forms/d/e/1FAIpQLSfzGZsk6-7aVWkBP2DBYVnTAIGrCzvo4l6hMmjesjjL4c62kA/viewform?usp=sf_link

Reinforcement learning (RL) is a subfield of machine learning concerned with developing algorithms that can learn to make decisions by interacting with the environment. This course will cover the fundamentals of reinforcement learning. We will also explore more advanced topics, such as deep reinforcement learning (deep RL) algorithms, imitation learning, hierarchical RL, meta-RL, programmatic RL, etc.

Throughout the course, students will implement and experiment with various RL algorithms, including Q-learning, policy gradient, and actor-critic methods. We will use popular RL frameworks, such as OpenAI Gym, to build and train RL models for a range of applications, such as games, robotics, and recommendation systems.

By the end of the course, students will have a solid understanding of the core concepts of reinforcement learning and the ability to apply these concepts to real-world problems. They will also be familiar with current research trends in RL and have the skills to continue learning and experimenting with RL algorithms beyond the scope of the course. 

課程目標
Upon completing this course, students will be able to formulate reinforcement learning problems as well as design, apply, and implement reinforcement learning algorithms to solve the problems. 
課程要求
The prerequisites of this course include the following:
- Solid background in machine learning and mathematics
- Strong implementation skills and familiarity with deep learning frameworks, e.g., PyTorch, TensorFlow 
預期每週課後學習時數
 
Office Hours
另約時間 
指定閱讀
 
參考書目
Reinforcement Learning: An Introduction second edition. Richard S. Sutton and Andrew G. Barto. 
評量方式
(僅供參考)
 
No.
項目
百分比
說明
1. 
Assignment 1 
20% 
 
2. 
Assignment 2 
20% 
 
3. 
Assignment 3 
20% 
 
4. 
Final Project 
40% 
Open-ended group project (4 students per group) 
 
課程進度
週次
日期
單元主題
無資料